As a Data Scientist embedded in the Advertising Organization, you will work with a group of data and non-data Mozillians responsible for understanding and driving the future of the internet. The Data Science team sits at the intersection of product, engineering, finance, business development, marketing, and leadership, and we collaborate closely with these and more partners to empower rigorous decision-making and create impactful data products.
Job listings
As a Pythian Data Scientist, you will be responsible for leveraging your expertise in data analysis, machine learning, and statistical modeling to deliver insights and build predictive models. You will work on both internal IP development and client projects, applying a variety of techniques to solve complex business problems. Additionally, you will use and fine-tune pre-trained models, such as Large Language Models (LLMs), to accelerate development and enhance model performance.
Machine Learning is integral to the continued success of Turnitin. As a Senior Machine Learning Scientist, you will join a global team to deliver cutting-edge Machine Learning systems, working closely with product and engineering teams to integrate Machine Learning into a broad suite of learning, teaching, and integrity products.
Build production machine learning models that identify fraud. Write production and offline analytical code in Python. Work with distributed data pipelines, communicating complex ideas effectively to a variety of audiences. Collaborate with engineering teams to strengthen our machine-learning platform, using your analytical toolbox to surface insights into real-time fraud attacks.
Lead the end-to-end execution of LLM training projects involving Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), and Reinforcement Learning from Execution Feedback (RLEF). Lead a cross-functional team of AI Trainers, Leads, and Engineering Managers to ensure the delivery of high-quality data and model improvements. Serve as the senior-most researcher accountable for strategic client alignment, throughput, quality, and operational efficiency across multiple programming languages (Python, JavaScript, Java, etc.).